Reliability: on the reproducibility of assessment data.

نویسنده

  • Steven M Downing
چکیده

CONTEXT All assessment data, like other scientific experimental data, must be reproducible in order to be meaningfully interpreted. PURPOSE The purpose of this paper is to discuss applications of reliability to the most common assessment methods in medical education. Typical methods of estimating reliability are discussed intuitively and non-mathematically. SUMMARY Reliability refers to the consistency of assessment outcomes. The exact type of consistency of greatest interest depends on the type of assessment, its purpose and the consequential use of the data. Written tests of cognitive achievement look to internal test consistency, using estimation methods derived from the test-retest design. Rater-based assessment data, such as ratings of clinical performance on the wards, require interrater consistency or agreement. Objective structured clinical examinations, simulated patient examinations and other performance-type assessments generally require generalisability theory analysis to account for various sources of measurement error in complex designs and to estimate the consistency of the generalisations to a universe or domain of skills. CONCLUSIONS Reliability is a major source of validity evidence for assessments. Low reliability indicates that large variations in scores can be expected upon retesting. Inconsistent assessment scores are difficult or impossible to interpret meaningfully and thus reduce validity evidence. Reliability coefficients allow the quantification and estimation of the random errors of measurement in assessments, such that overall assessment can be improved.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Investigating the Reliability and Reproducibility of the International Caries Detection and Assessment System Index in Evaluation of Dental Decay in 25- to 40-Year-Old People and Comparing it with the Decayed, Missing, Filled Index

Background The Decayed, Missing, Filled (DMF) index is one of the most common techniques for investigating dental decay. In recent years, the new International Caries Detection and Assessment System  (ICDAS) technique has been introduced to the world as a standard tool to confront the present challenges in diagnosing dental decay. The present study aimed to investigate the reliability and repr...

متن کامل

ارزیابی پایایی و تکرارپذیری پرسش‌نامه بسامد مصرف غذایی و شناسایی الگوهای غذایی غالب در بزرگسالان دارای اضافه وزن و چاق تبریز

Background and purpose: This study was aimed to assess the reliability and reproducibility of a designed food frequency questionnaire (FFQ) and to determine the major dietary pattern of overweight and obese adults in Tabriz, Iran. Materials and methods: The study included two studies: (1) a pilot study (n = 30) assessment of reliability and reproducibility of FFQ, (2) a cross-sectional study (...

متن کامل

Comparison of Double and Single Leg Weight-Bearing Radiography in Determining Knee Alignment

  Background: Knee malalignment is an important modifiable cause of osteoarthritis (OA). Surgical therapeutic procedures depend on proper knee alignment assessment. The purpose of this study was to compare knee alignment parameters between double and single leg weight-bearing radiographs and to evaluate the reproducibility of inter- and intra-observer measurements. Methods: One hundred eight p...

متن کامل

بررسی قابلیت تکرارپذیری سه روش مختلف اندازه‌گیری عرض لثه کراتینیزه

Background and Aim: Although the need for "adequate" amount of keratinized tissue (KT) for periodontal health is questionable, the mucogingival junction (MGJ) often serves as a measurement landmark in periodontal evaluations. Limited information is available on the reproducibility of KT width (KTW) assessment. The purpose of this study was to assess reproducibility of 3 different methods to ide...

متن کامل

Quality assessment of conventional X-ray diagnostic equipment by measuring X-ray exposure and tube output parameters in Great Khorasan Province, Iran

Introduction: Regular implementation of quality control (QC) program in diagnostic X-ray facilities may affect both image quality and patient radiation dose due to the changes in exposure parameters. Therefore, this study aimed to investigate the status of randomly selected conventional radiographic X-ray devices installed in radiology centers of Great Khorasan Province, Iran, to produce the da...

متن کامل

Surface Electromyographic Assessment of Swallowing Function

The reliability of surface electromyographic (sEMG) variables during swallowing determines the potential usefulness of these measures in swallowing assessment and treatment. This study aimed to establish the reliability of the sEMG measures of the swallowing function of muscles during different swallowing conditions in healthy young and old volunteers. Two groups of volunteers (24 older adults,...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • Medical education

دوره 38 9  شماره 

صفحات  -

تاریخ انتشار 2004